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ABSTRACT 

The initial shear field, characterized by a primordial perturbation potential, plays 
a crucial role in the formation of large scale structures. Hence, considerable analytic 
work has been based on the joint distribution of its eigenvalues, associated with Gaus- 
sian statistics. In addition, directly related morphological quantities such as ellipticity 
or prolateness are essential tools in understanding the formation and structural proper- 
ties of halos, voids, sheets and filaments, their relation with the local environment, and 
the geometrical and dynamical classification of the cosmic web. To date, most analytic 
work has been focused on Doroshkevich's unconditional formulae for the eigenvalues 
of the linear tidal field, which neglect the fact that halos (voids) may correspond to 
maxima (minima) of the density field. I present here new formulae for the constrained 
eigenvalues of the initial shear field associated with Gaussian statistics, which include 
the fact that those eigenvalues are related to regions where the source of the displace- 
ment is positive (negative): this is achieved by requiring the Hessian matrix of the 
displacement field to be positive (negative) definite. The new conditional formulae 
naturally reduce to Doroshkevich's unconditional relations, in the limit of no correla- 
tion between the potential and the density fields. As a direct application, I derive the 
individual conditional distributions of eigenvalues and point out the connection with 
previous literature. Finally, I outline other possible theoretically- or observationally- 
oriented uses, ranging from studies of halo and void triaxial formation, development 
of structure-finding algorithms for the morphology and topology of the cosmic web, 
till an accurate mapping of the gravitational potential environment of galaxies from 
current and future generation galaxy redshift surveys. 

Key words: methods: analytical, statistical — cosmology: theory, large-scale struc- 
ture of Universe — galaxies: formation 



1 INTRODUCTION 

The large-scale spatial distribution of dark matter, as revealed from numerical simulations, shows a characteristic anisotropic 
web-like structure. This cosmic web, which arises through the gravitational clustering of matter, is mainly due to the effects of 
the tidal field: in fact, the competition between cosmic expansion, the trace, and the traceless part of the tidal field imprints 
anisotropics in the large-scale matter distribution in much the same way that gravity and radiation pressure imprints baryonic 
acoustic oscillations (BAO) on the Cosmic Microwave Background (CMB) sky (Hu & Sugiyama 1995; Lee & Springel 2010). 
Hence, the initial shear field plays a crucial role in the formation of large scale structures, and a number of studies in the 
literature have been devoted to this subject - among the plethora of papers, see for example the classic works by Zeldovich 
(1970), Icke (1973), Peebles (1980), White (1984), Bardeen et al. (1986), Kaiser (1986), Bertschinger (1987), Bond & Myers 
(1996), Bond, Kofman & Pogosyan (1996) and van de Weygaert & Bertschinger (1996). In addition, if the cosmic web originates 
from primordial tidal effects and its degree of anisotropy increases with the evolution of the Universe, then studying this initial 
field is crucial in understanding the subsequent nonlinear evolution of cosmic structures (Springel et al. 2005; Shandarin et 
al. 2006; Desjacques 2008; Desjacques & Smith 2008; Pogosyan et al. 2009), the alignment of shape and angular momentum 
of halos (West 1989; Catelan et al. 2001; Lee & Springel 2010; Rossi, Sheth & Tormen 2011), the statistical properties of 
voids (Lee & Park 2006; Platen, van de Weygaert & Jones 2008), and more generally for characterizing the geometry and 
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morphology of the cosmic web (Shen et al. 2006; van de Weygaert & Bond 2008; Forero-Romero et al. 2009; Aragon-Calvo et 
al. 2010a,b; Shandarin et al. 2010). 

The basic theory for the formation and evolution of structures is now well-understood, thanks to the pioneering work of 
Doroshkevich & Zeldovich (1964), Doroshkevich (1970), Zeldovich (1970), and Sunyaev & Zeldovich (1972) - the latter in the 
context of galaxy formation. In particular, Doroshkevich (1970) derived the joint probability distribution of an ordered set of 
eigenvalues in the tidal field matrix - at random positions - given the variance of the density field, corresponding to a Gaussian 
potential; we will refer to it as the unconditional probability distribution of eigenvalues. In addition, Zeldovich (1970) provided 
the fundamental understanding of anisotropic collapse on cosmological scales, and recognized the key role of the large scale 
tidal force in shaping the cosmic web. Subsequently, Doroshkevich and Shandarin (1978) calculated some statistical properties 
of the maxima of the largest eigenvalue of the shear tensor. Their study proved that the most probable formation process 
starts first with a one-dimensional collapse (cosmic pancake formation) . The directions (orientations) for the one-dimensional 
collapsed sheets are determined by the largest eigenvalue of the deformation tensor, which can be attributed to the initial 
linear density perturbations; moreover, the probability that two or even three of the initial eigenvalues are identical or nearly 
equal is extremely small, indicating that the collapse is triax;ial. Later on, following a constrained field approach pioneered 
by Bertschinger (1987), van de Weygaert & Bertschinger (1996) developed an algorithm for setting up tailor-made initial 
conditions for cosmological simulations, which addressed the role of the tidal fields in shaping large-scale structures. In the 
same period. Bond, Kofman & Pogosyan (1996) developed a cosmic web theory which naturally explains the filamentary 
structure present in the Cold Dark Matter (CDM) cosmology, due to the coherent nature of the primordial tidal field. They 
realized that an "embryonic" cosmic web is already present in the primordial density field, and explained why in overdense 
regions sheet-like membranes are only marginal features. Since then, because of the correspondence between structures in the 
evolved density field and local properties of the linear tidal field pointed out by the same authors, the statistics of the shear 
has received more attention in the literature. For example, Lee & Shandarin (1998) computed some probability distributions 
for individual shear eigenvalues and obtained an analytic approximation to the halo mass function, and Catelan & Porciani 
(2001) explored the two-point correlation of the tidal shear components. 

However, a variety of studies in the physics of halo formation and cosmic web classification are based on Doroshkevich's 
unconditional formulae for the ordered eigenvalues of the initial shear field associated with Gaussian statistics (Doroshkevich 
1970), but those formulas cannot differentiate between random positions and peak/dips as they neglect the fact that halos 
(voids) may correspond to maxima (minima) of the density field. According to Bardeen et al. (1986), if one assumes the 
cosmological density fiuctuations to be Gaussian random fields, the local maxima of such fields are plausible sites for the 
formation of nonlinear structures. Hence, the statistical properties of the peaks can be used to predict the abundances and 
clustering properties of objects of various types, and in studies of the non-spherical formation of large-scale structures. The 
study of Bertschinger (1987) goes in this direction, by generalizing the treatment of Bardeen et al. (1986) and proposing a 
path integral method for sampling constrained Gaussian random fields. The method allows one to study the density field 
around peaks or other constrained regions in the biased galaxy formation scenario, but unfortunately it is too elaborate and 
inefficient in its implementation. Instead, by applying the prescription of Hoffman & Ribak (1991) to construct constrained 
random fields, van de Weygaert & Bertschinger (1996) were able to show that it is possible to generate efficiently initial 
Gaussian random density and velocity fields, and specify the presence and characteristics of one or more peaks and dips at 
arbitrary locations - with the gravity and tidal fields at the site of the peaks having the required strength and orientation. 
Some other studies on constrained initial conditions were also pursued by van Haarlem & van de Weygaert (1993), and by 
van de Weygaert & Babul (1994). The idea has been expanded in Bond & Myers (1996), who presented a peak-patch picture 
of structure formation as an accurate model of the dynamics of peaks in the density field. Their approach goes further, as 
it involves the explicit formalism for identifying objects in a multiscale field (i.e. it does not restrict to a single scale). The 
model is even more precise for void patches, the equivalent framework for studying voids (Sahni et al. 1994; Sheth & van 
de Weygaert 2004; Novikov, Colombi & Dore 2006; Colberg et al. 2008). In general, density peaks define a well-behaved 
point-process which can account for the discrete nature of dark matter halos and galaxies, and on asymptotically large scales 
are linearly biased tracers of the dark matter field (Desjacques & Sheth 2010). 

Therefore, it would be desirable to incorporate the peak (or dip) constraint in the statistical description of the initial 
shear field, in order to characterize more realistically the geometry and dynamics of the cosmic web. The main goal of this 
paper is to do so, by providing a set of analytic expressions which extend the work of Doroshkevich (1970) and Bardeen et 
al. (1986), and are akin in philosophy to that of van de Weygaert & Bertschinger (1996). This is achieved by constraining the 
Hessian of the displacement field (the matrix of the second derivatives) to be positive (negative) definite, which is the case in 
the vicinity of minima (maxima) of the source of the displacement field. The new conditional probability distributions derived 
in this study include the correlation between the potential and density fields through a reduced parameter r, and naturally 
recover Doroshkevich's (1970) unconditional formulae in the absence of correlation. 

Hence, the main focus of this work is to derive explicitly the joint probability distribution of the eigenvalues of the shear 
field, given the fact that positions are peaks or dips in the corresponding density field - and not random locations. In this 
sense the field is termed constrained (i.e. it has constrained eigenvalues, which are the result of looking only at peak/dip 
regions), and it is statistically described by a conditional probability distribution. Note that even though the formalism is 
essentially restricted to one scale, the formulae derived here are useful in a variety of applications - some of which will be 
discussed at the end of this paper. 

The layout is organized as follows. Section [2] provides the derivation of the new analytic expressions for the conditional 
distributions of eigenvalues of the initial shear field. In particular. Section [2.11 illustrates the basic notation adopted; Section 
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1 2. 2 1 contains the derivation of the joint distribution of eigenvalues in the peak/dip picture, while Section [2. 31 shows the reverse 
conditional distribution function. As a direct application of the new formulae, Section|3]presents the individual distributions of 
eigenvalues subjected to the extremum constraint, along with some other related conditional probabilities. This part extends 
previous work by Lee & Shandarin (1998), which is briefly summarized in Appendix [X] and complements the study of van de 
Weygaert & Bertschinger (1996). Finally, Section |4] highlights the main results and discusses ongoing and future applications, 
which will be presented in forthcoming publications. 



2 JOINT DISTRIBUTION OF EIGENVALUES AT PEAK/DIP LOCATIONS 

In this section, I first introduce the basic notation adopted throughout the paper. I then derive the conditional joint distribution 
of eigenvalues of the tidal field, under the constraint that the Hessian matrix of the source of the displacement is positive (or 
negative) definite. Finally, I present the reverse probability function, useful in relating this work with that of Bardeen et al. 
(1986) and van de Weygaert & Bertschinger (1996). 



2.1 Basic notation 

Let ^' denote the displacement field, $ the potential of the displacement field, S** the source of the displacement field. Indicate 
with q the Lagrangian coordinate, with x the Eulerian coordinate, where 

x(q) = q + *(q). (1) 

Use Tij to denote the shear tensor of the displacement field, Hij for the Hessian matrix, for the Jacobian of the displacement 
field = 1,2,3). Clearly: 

-^ij(q) = |^ = '5ij+rij (2) 

'■^ - dqj dq^dq; 

- 1^ (4) 

The eigenvalues of are Ai,A2,A3, those of Hy^ axe 5i,^2,C3- In general, the ordering of the eigenvalues is assumed to be 
such that Al ^ A2 ^ A3. Let the potential "I> be a Gaussian random field determined by the power spectrum of matter density 
fluctuations P{k), with k denoting the wave number and W{k) the smoothing kernel. The density field described by the source 
S\s/ is also a Gaussian random field. The correlations between these two fields are expressed by: 

2 

{TijTki) = -j|-(5ij<5ki + 5ik5ji + 5ii5jk) (6) 

(J/ijJi-M) = ^ (SiAi + Sii^Sii + 5ii5jk) (7) 

(Tiji/ki) = ^{SiAi + -^ik^ji + 5ii5jk) (8) 
where = S2 = erg, a'^ — Se = o"2, Fth = — S4 = — (Ti, Sij is the Kronecker delta, and 



= j ■P(fc) W\k)Ak (9) 

Tij and are real symmetric tensors, so they are specified by 6 components (note however that only 5 are independent once 
the height of the density peak or dip has been set, because the trace of the shear field tensor is entirely constrained by the 
local density field value via the Poisson equation). Label them as A = (1, 1), B = (2, 2), C = (3, 3), D = (1, 2), E — (1, 3), 
F = (2, 3); the symbols a or /3 indicate the various couples, in a compact notation, where ct, j3 = A, B, C, D, E, F. For example. 
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if a = D then Ta = To = T12, and so forth. In the subsequent derivations, for clarity all the various dependencies on cr's are 
dropped. This is done by introducing the "reduced" variables T and H, defined as 

Ta = Ta/aT, Ha — Ha/au (11) 

and the "reduced" correlation 

^2 

r = rTH/o-TO-H = — =^ -7 (12) 

where 7 is the same as in Eq. (4.6a) of Bardeen et al. (1986). In this notation, the eigenvalues of T and of H are A^'s and 
^i's, respectively, with i = 1,2, 3. Of course, the a dependence can be restored at any time, if desired, by using the previous 
Equations (fTTj) and ([12}. 



2.2 Joint conditional distribution of eigenvalues in the peak/dip picture 

The main goal of this section is to derive the joint distribution of the eigenvalues of when the ordering is Ai ^ A2 ^ A3, in 
regions where the source of displacement 5** is a minimum (or maximum). Being a minimum (maximum) of S\sj implies that 
the displacement field is such that the gradient of 5* is zero, and the Hessian matrix Ha is positive (or negative) definite. In 
block notation, the covariance matrix V of the 12 reduced components of y = (T, H) is: 



where 



(HpTa) (HpHp)^ 



^=(0 7).S=|1 3 1| (14) 




with / a (3 X 3) identity matrix and a (3 x 3) null matrix. The inverse of the reduced covariance matrix is simply 

15 / A-^ -r. 
(1 - r2) \-rA'^ A 

where 



c = i- - 77^ (-f,'-. ''S') (15) 



The joint probability of observing a tidal field T for the gravitational potential and a curvature H for the density field (from 
now on, the understood indices a and /? are dropped) is a multivariate Gaussian in Y = {T,H): 

— — -C Y 

p{Y\r)dY = — ]=dY = p{f, H\r)dfdH. (17) 



i27v)<^^\V\ 

Therefore 

- - fi,^nPiT,H\r) dH ff,^,^p(H) ■ p(f\r,H) dH 

p(H>0) Jf^^^p{H)dH 

- - v(f iilr) v(f H\r) -i(t-b)T..^-i.(T-b) 

p(T\r, H) = ' ' ^ = ^K' ^> - = = . (19) 

p{H) S^p{T,H\T)dT {2^Y^A^\ 

The marginal distributions p(T) and p(H) axe simply multidimensional Gaussians with covariance matrix j4/15, and therefore 
they can be expressed using Doroshkevich's formulae as 

= 3 



where 



P{f) = -l|-^e-S(^'^-«^^) (20) 



with 



fci = rn + r22 + r33 (21) 

k2 = fiiT 
and similarly 



k2 = f\\f 22 + T11T33 + T22T'33 — fi2 — Tx3, — ^23 (22) 
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with 

/ii = H11 + H22+H33 (24) 

/l2 = H11H22 + H11H33 + H22H.i3 — H12 — H\3 — H23- (25) 

The conditional distribution p{T\r,H) is also a multidimensional Gaussian, with mean b and covariance matrix , where 

h = rH, = —(1 - r^)A. (26) 
15 

Carrying on the calculation indicated in (|19p yields: 

rHii) + (faa - rH22) + (T33 - rHas) = fci - r/ii 
- rHii){f22 - r//22) + (Til - rHii){f33 - rJ/ga) 

rH22){f33 - rH33) - ifi2 - rHuf - (Tis - rHr3f - (T23 - rJ/as)^ = + r^/12 - r/iifci + rrj (28) 



where 




ifi = 


(Til 


K2 = 


(Til 


+ 


(T22 


and 





= Tii/fii + T22H22 + T33H33 + 2Ti2-ffi2 + 2ri3_ffi3 + 2r23-H'23 (29) 

accounts for cross-correlation terms. 

The previous expression (|27|) generalizes Doroshkevich's formula (|20|1 to include the fact that halos (voids) may correspond 
to maxima (minima) of the density field. Note that, if r = 0, Doroshkevich's formula (|20|l is indeed recovered - as expected in 
the limit of no correlations between the potential and density fields. Equation H27|l is written in a Doroshkevich-like format, 
and it is one of the key results of this paper. 

It is also possible to express (|27p in terms of the constrained eigenvalues C,'s of the {T\r,H) matrix. The result is: 



P(Ci,C2,C3|r) = p(Ai,A2,A3|r,Ci,C2,6) (30) 

15=* 1 r : 



8^7r(l-r2)3''''PL 2(1 -H)' 
where in terms of constrained eigenvalues: 

= Ci + C2 + Ca = fci - r/ii (31) 

K2 = C1C2 + CiCa + C2C3 = k2+ r^/i2 - r/iifci + rj? (32) 

■n = Aifi + A2I2 + A3I3 (33) 

fci = Ai + A2 + A3 (34) 

k2 = A1A2 + A1A3 + A2A3 (35) 

hi = |i+|2+|a (36) 

h2 = I1I2 + Ilia + |2|a (37) 
with the partial distributions expressed by Doroshkevich's unconditional formulae as 

p(Ai, A2, A3) = ^^e-i(2'^i-'''^^'(Ai - A2)(Ai - A3)(A2 - A3), (38) 
8v57r 

p(|i, I2, 13) = -ij-e-i(^'^?-5''^)(|i - |2)(|i - |3)(|2 - la). (39) 

and 

Ci = (Ai|r,|0 = Ai-r|i (40) 



-{2Kl - ^K2)] ■ KCi - C2)(Ci - C3)(C2 - Ca)! 



which is obvious from (|28p . with Ai's and ^i's the corresponding unconstrained eigenvalues of the matrices T and H. Equation 
H30p is another key result of this work. This relation is easily derived from (|27|) because the volume element in the six- 
dimensional space of symmetric real matrices is simply given, in terms of constrained eigenvalues, by 

Y[ A{f^\T, H^)^\ (Ci - C2) (C2 - Ca) (Ci - Ca) IdCid^dCadfisa (41) 

where dfigs is the volume element of the three-dimensional rotation group 50(3), and also of the three-sphere (see Appendix 
B in Bardeen et al. 1986). In addition, from (|28|) it is direct to see that one has Tli = Ai and Ha = in the system where 
both T and H are diagonal; therefore Ki = X]i(Ai — r^i) = "^^i Q, with Ci the constrained eigenvalues. 

The fact that the system in which T is diagonal is also the system in which H is diagonal has a simple geometrical 
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explanation: in order to obtain the conditional distribution {T\r,H), a standard procedure for multivariate Gaussians (when 
transforming from uncorrelated to correlated variates) is to use a Cholesky decomposition. However, this decomposition does 
not sample symmetrically with respect to the principal axes. An alternative method which takes care of the alignment with 
the principal axes (and used here) is the following: decompose spectrally the covariance matrix of {T\r,H), A, i.e. find the 
eigensystem D and diagonal eigenvalue matrix A of the covariance matrix such that A = D ■ A - ; then set W = D ■ A^^^Y , 
with Y a set of independent zero mean unit variance Gaussians. It is direct to show that W is indeed the correct covariance 
matrix, and this scheme takes care of the alignment with the principal axes. 



2.3 The reverse joint conditional distribution of eigenvalues 



Equation (|18|) expresses the probability of observing a tidal field T in regions where the curvature H of the density field is 
positive/negative (i.e. peak or dip regions). One may be also interested in the reverse joint conditional distribution, namely 
p(i/|r, T > 0). Obtaining its expression is easily achieved from the previous algebra by applying Bayes' theorem, so that: 

^,H\r f > 0) = ^T>oPiH.f\r) AT ^ U^^pjf) df ■ piH\r,f) 
p{f>0) h^,P{T)Af 

where now (symmetrically) 

viH f\r) n(H f\r) -i(H-b')T..i^-i.(H_b') 

p(H\r, T) = 'f ^ ' = ' J ' . = = . (43) 

p{T) J^p{H,T\r)AH {2Ti)'i,/\^\ 

The distribution p{H\r,T) is also a multidimensional Gaussian with mean b' and covariance matrix , where 

h' = rf, = r^)A. (44) 

Along the lines of the previous calculation, it is direct to obtain 

IS'^ 1_ 

16^7r3 (l-r2)3-^""L 2(1 

where in analogy with H27p one has now: 

Hi = (i^ii-rrii) + (//22~rT22) + (Jy'33-rf33) = /ii-rfci 
H2 = {Hii - rfii){H22 - rf22) + {Hii - rfii){H33 - rfaa) 

+ {H22 - rf22){H33 - rfss) - {H12 - rTi2)' - {H13 - rf^sf - {H23 - rf 23? = + r'fe - rfeifti + rr?. (46) 

In addition, if £1,62,63 are the constrained eigenvalues of {H\r,T), then 

p(£i,e2,e3ir) = p(|i, I2, ^k, Ai, A2, A3) (47) 
15^ 1 



8V57r(l-r2)3''''PL 2(1- 
where in terms of constrained eigenvalues 

Hi = ?i + ?2 + ?3 = fti - rki (48) 
H2 = 6162 + 6163 + 6263 = /i2 + r^fe — r/iifci + r7?, (49) 

with the 61 's simply given by 

6-i = (|i|r,Ai)=fi-rAi. (50) 

Note an interesting symmetry: Ki, Hi, fci, hi are always the traces of their corresponding matrices. 

New analytic formulae derived from (|45p and (|47p . which generalize the work of Bardeen et al. (1986), will be presented in 
a forthcoming publication. The previous relations are also useful in making the connection with the work of van de Weygaert 
& Bertschinger (1996). For example. Equation (|50|) confirms their finding of the correlation between the shifted mean values 
for density and shear (see for example Equation 108 in van de Weygaert & Bertschinger 1996, which provides the reverse 
mean value of the shear given a density field shape, and their Section 4.4 for more details). 



-{2H! - 5H2)] ■ |(6~i - 6-2) (6-1 - 63) (6-2 - 6-3)1 



3 INDIVIDUAL CONDITIONAL DISTRIBUTIONS AND PROBABILITIES 

A direct application of the main formulae derived in Section 12.21 is to compute the individual distributions of eigenvalues 
(along with some other related conditional probabilities), given the peak/dip constraint. Starting from Equation (I30p . with 
some extra complications due to the inclusion of correlation terms, it is possible to extend the work of Lee & Shandarin (1998) 
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Figure 1. Individual conditional distributions p{Ci\r) of the initial shear field in the peak/dip picture fEguations 1531 [54l and [55)l . for 
different values of the reduced correlation parameter r — as indicated in the panels. 



to account for those regions where the source of the displacement is positive (negative) - see the Appendix |A] for a compact 
summary of their results. Introducing the following notation: 

1125 



I = 



I' 



L'{x,y\r) 
N'{x,y\r) 



/• (l-r^)-"^/' 
\/3^/12 

V^-r^ (x - y){3x - t/) ■ exp[- JY~:2^{^^ - xy + 

{x - y) [8(1 - r') + 3{3x - y){3y - x)] ■ exp [ - (Sx^ - 2xy + 3y^)] • erfc [-^ ^ 



(51) 



4 



-.{x - 3y) 



and dropping the understood ~ symbol (but note that the eigenvalues are still "reduced variables"), the two-point probability 
distributions of the constrained eigenvalues of the shear field are given by: 



p{CiX2\r) = i'[L'(Ci,C2|r) + n-iV'(Ci,C2lr)] 
p{C2,Ca\r) = r[L'(C3,C2|r) + n-iV'(C3,C2|r)] 

P(Ci,C3|r) = l'{L'{Cr,C3\r) + L'{C3Xi\r) + n- [iV'(Ci, Cair) + iV'(C3, Ci|0] }• 



(52) 



Performing the various partial integrations, it is straightforward to obtain: 



p{Ci\r) = /'{ i'(Ci,C2|r)dC2 + n iV'(Ci,C2lr)dC2} 



(53) 



\/5 r 20 



V5 r 

127r I. 



(l-r2) 



Ci exp 



9C? 



27r 



2(l-r2)y (l_j.2)3/2 



exp 



5C: 



2(1 -r2) 



72Ci 



:)[(l-r^)-20C 



3V37r 



(- ^5Cf \ 



4(1 -r2). 



erfc I — 



L'(Ci,C2|r)dCi+n iV'(Ci,C2|r)dCi} 



^ 1 

20Fyr^^ 



(54) 



exp 



r 15 C2 ] 

L 4 (l-r2)J 



KCslO = /'I / 'L'(C3,C2|r)dC2+n / 'iV'(C3,C2lr)dC2| 

V5 r 20 / 9C| \ V2^ 

127rL(l-r2)^^ ^''Pl 2(1 -r2)/ (1 _ r2)3/2 

3^3^ 



exp 



15Cl 



1 6rfc ( 

4(1 -r2)/ \2VT^ 



V2^/^^72/J 



' M(i-^')-2ocl] 



(55) 




Figure 2. Conditional probability distribution p{Ki\r,(^3 > 0) = p(A|r, ^3 > 0) given by Equation l|61|l . for different values of r, as 
indicated in the panels. Solid vertical lines show the maximum of each distribution (Kf^'^^ ~ 1.5 Vl — r'^), dotted lines represent their 
corresponding average values as given by Equation l|62|l . 



Plots of these distributions are shown in Figure [U for different values of the reduced correlation r. Note the symmetry 
between p{(i\r) and ^(Csl''), evident from H53|) and H55|) . In particular, p{C^2\r) is simply a Gaussian with variance 
and so the conditional distribution p(A2|r, ^2) is a Gaussian with the same variance and shifted mean r^2 (recall that ("2 — 
A2 — ^"^2). In addition, it is easy to show that, with A = i^i + (^2 + Cs = Ki, one obtains 

•"(i.&W = l||(T^;3j5[l5A'-90A& + 135C|-8(l-r»)] . „p [ - (3A' - lOAC. + MC? )] 

By integrating the previous equation over ^3, it is direct to verify that p(A|r) = p{Ki\r) is a Gaussian with zero mean and 
variance (1 — r^), namely: 

P(A1.) = ^7=^ • exp[ - = Pmr). (57) 

Since Ki = fci — rhi (i.e. Eg. ISlf) . the previous expression implies that p{ki\r,hi) is therefore a Gaussian with mean rhi and 
variance (1 — r^). In addition: 

p(Ci>0|r) = |, (Cilr) = ^v/r^, 4,, = 15^(1 2) (58) 

P(C2>0|r) = i (C2|r)=0, = (59) 

2 3 r J 2 IStt - 27 

Moreover, the probability distribution of A confined in the regions with (^3 > is: 
75V5 A / 9 A 



P(C3>0lr) = ^, (C3|r) = -^\/r^, 4|, = i£^^(l-^2)^ (go) 



Stt (l-r2) "^""PV 8(1 
25 1 / A^ \ r ./ VWA \ J yiOA 



/ _ 9 A^ \ 
V 8fl-r2)J 



' 4^yr"^ '''''^v 2(i-r2)y r"V4yr^^^ ' ^''V2^/]~^ 

Using the previous formula, it is direct to prove that 



(A|r, Ca > 0) = |ffi£(3^ _ 2) ~ 1.6566 ^J\-T^. (62) 

144y TT 

Figure [2] shows the conditional probability distribution p(K\\r, (^3 > 0) = p(Ajr, (^3 > 0) (i.e. Equation [M}, for different values 
of r. The dotted vertical lines display the average values of those functions, as given in ((62]), while the vertical solid lines 
indicate their maximum values. In fact, it is easy to see that the maximum of p(A|r, (^3 > 0) is reached when A ~ 1.5 \j\ — r"^. 
This result is readily obtained by computing the derivative of p(A|r, (^3 > 0), and by finding the corresponding zeroes. An 
example of this calculation is shown in Figure O where an arbitrary value of r = 0.5 is considered. 
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Figure 3. Determination of tiie maxima of l|61|l . Tfie derivative |C3 > 0, r) is shown, for an arbitrary value of the reduced correlation 

r, i.e. r = 0.5. The maxima correspond to those points where iC™^'^ ~ 1.5 -^1 — r^, indicated with solid vertical lines in the figure. 



Finally, the probability that a given region with density A will have all positive eigenvalues ('s is: 

p(A,C3>0|r)_ 3\/T0 A / 5 A^ \ 1 r / VWA \ / VWA 



„(C3 > OIA r) - P(^'^^>^\'-) - 3V10 A / _ 5 A- n 1 r ^ yiQA x 



p{A\r) 4^ V 8(l-r2)y 21 \aVT^^ \2VT~i 

Note that, in absence of correlations between the potential and density fields (i.e. when r = 0), all the previous expressions 
reduce consistently to the unconditional limit of Lee & Shandarin (1998) - see again Appendix [X] Several other results on 
probability distributions along these lines were also already derived by van de Weygaert & Bertschinger (1996): here we 
confirm their findings that the inclusion of the condition for a peak or dip in the Gaussian density field involves a l/vT^-^ 
factor, and that the conditional distributions have shifted mean and reduced variance (see again their Section 4). 



4 CONCLUSION 

Since the initial shear field associated with Gaussian statistics plays a major role in the formation of large scale structures, 
considerable analytic work has been based on the joint distribution of its eigenvalues - i.e. Doroshkevich's formulae (Equation 
I20p . However, Doroshkevich's equations neglect the fact that halos (voids) may correspond to maxima (minima) of the density 
field. The main goal of this work was to provide new analytic expressions, in the context of the peak/dip picture (Bardeen et 
al. 1986; Bond et al. 1991), which include the fact that the eigenvalues of the linear shear field are related to regions where the 
source of the displacement is positive (negative). These new conditional probabilities, derived in Section[2l are Equations (|27p . 
H30p . (|45p and (|47|) : they represent the main results of this paper. Written in Doroshkevich-like format, they naturally reduce 
to Doroshkevich's (1970) unconditional relations in the limit of no correlation between the potential and the density fields 
(i.e. when r = 0). As a first direct application of pop . the individual conditional distributions of eigenvalues were obtained in 
Section [S] these relations extend some previous work by Lee & Shandarin (1998) - see Appendix for a compact summary 
of their results. Much more analytic work can be carried out using these new formulae, especially in connection with the 
statistics of peaks developed by Bardeen et al. (1986): their calculations can be extended within this framework (see Section 
I2.3[l . and results of this extension will be presented in a forthcoming publication - along with more insights on the equations 
derived in Section [2.21 

To obtain the conditional distribution p{T\r,H > 0) (i.e. Equation I18p . in principle one needs to sample numerically 
the probability (|19|) as done in Lavaux & Wandelt (2010). However, the new analytic results of this paper suggest a simpler 
generalized excursion set algorithm, which will be also presented in a forthcoming study. The algorithm allows for a fast 
sampling of (|19p . and permits to test the new relations derived here (i.e. Sections [2] and [S] and Appendix IX)) against mock 
data. In addition, along these lines it is possible to extend the main shape distributions involved in the triaxial formation of 
nonlinear structures (i.e. ellipticity e, prolateness p, axis ratios v and /i, etc.). Halos and voids are in fact triaxial rather than 
spherical (i.e. Rossi, Sheth & Tormen 2011), and the initial shear field plays a crucial role in their formation. For example, 
in the ellipsoidal collapse framework the virialization condition depends on ellipticity and prolateness, which are directly 
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related to the eigenvalues of the external tidal field. Starting from p{(i, (2, (air), it is then straightforward to turn this joint 
conditional distribution into p{e,p, S\r) - to include the peak constrain, - and then to characterize p{e\r), p{p\r), p(/i|r), p{i^\r) 
and so forth. Applications to halos and voids will be also presented next, including implications for the skeleton of the cosmic 
web. The extension to non-Gaussian fields, along the lines of Lam et al. (2009), is a natural follow-up of this work and will be 
presented in a forthcoming publication as well. In the context of cosmic voids, other interesting uses of the new formulae (|27p . 
(|30p . H45p and (|47|) involve the Monge-Ampere-Kantorovitch reconstruction procedure - see for example Lavaux & Wandelt 
(2010). 

The analytic framework described here can be also useful in several observationally-oriented applications, and in particular 
for developing algorithms to find and classify structures in the cosmic web. For example. Bond, Strauss & Cen (2010) presented 
an algorithm that uses the eigenvectors of the Hessian matrix of the smoothed galaxy distribution to identify individual 
filamentary structures. They used the distribution of the Hessian eigenvalues of the smoothed density field on a grid to study 
clumps, filaments and walls. Other possibilities include a web classification based on the multiscale analysis of the Hessian 
matrix of the density field (Aragon-Calvo et al. 2007), the skeleton analysis (Novikov et al. 2006), as well as a morphological 
(Zeldovich-based) classification (for instance Klypin & Shandarin 1983; Forero- Romero et al. 2009). More generally, the fact 
that the eigenvalues of the Hessian matrix can be used to discriminate different types of structure in a particle distribution 
is fundamental to a number of structure-finding algorithms (Forero-Romero et al. 2009), shape-finders algorithms (Sahni et 
al. 1998), and structure reconstruction on the basis of tessellations (Schaap & van de Weygaert 2000; Romano-Diaz & van 
de Weygaert 2007), etc. In addition, the classification of different environments should provide a framework for studying the 
environmental dependence of galaxy formation (see for example Blanton et al. 2005). To this end, recently Park, Kim and 
Park (2010) extended the concept of galaxy environment to the gravitational potential and its functions - as the shear tensor. 
They studied how to accurately estimate the gravitational potential from an observational sample finite in volume, biased 
due to galaxy biasing, and subject to redshift space distortions, by inspecting the dependence of dark matter halo properties 
on environmental parameters (i.e. local density, gravitational potential, ellipticity and prolateness of the shear tensor). It 
would be interesting to interpret their results with the theoretical formalism developed here, and ultimately to study the 
gravitational potential directly from a real dataset such as the SDSS Main Galaxy sample within this framework. 

The formalism presented in this paper is restricted to one scale (i.e. peaks and dips in the density field, as in Bardeen 
et al. 1986), but the extension to a multiscale peak-patch approach along the lines of Bond & Myers (1996) is duable and 
subject of ongoing work. This will allow to account for the role of the peculiar gravity field itself, an important aspect not 
considered here but discussed for example in van de Weygaert & Bertschinger (1996). In fact, these authors introduced the 
peak constraints to describe the density field in the immediate surroundings of a peak, and then addressed the constraints on 
the gravitational potential perturbations; in particular, they constrained the peculiar gravitational acceleration at the position 
of the peak itself, in addition to characterizing the tidal field around the peak. Including all these effects in our formalism is 
ongoing effort. Finally, the interesting and more complex question of the local expected density field alignment/orientation 
distribution as a function of the local field value (or the other way around - see Bond 1987; Lee & Pen 2002; Porciani et al. 
2002; Lee 2011) can be addressed within this framework, and is left to future studies. 
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APPENDIX A: INDIVIDUAL UNCONDITIONAL DISTRIBUTIONS AND PROBABILITIES 

Starting from Doroshkevich 's formulae for the unconditional distribution of the ordered eigenvalues Ai ^ A2 ^ A3 (Eq. [38]), 
Lee & Shandarin (1998) derived a set of useful probability functions. Their expressions can be considerably simplified by using 
various symmetries, and by introducing the following notation: 

1125 ,,,, 
I = ^ (Al) 



37r/12 

3 



L{x,y) = [x -y){Sx -y) ■ exp^~3(^x'^ - xy + -y^^^ 
N{x,y) = {x - y)[S + 3{3x - y){Sy - x)] -expl^- ^(Sx'^ - 2xy + 3y^^'^ ■ erfc [^-^(x - 3y)] . 
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Dropping the understood ~ symbols for the eigenvalues (although all the quantities are still "reduced", i.e. the a dependence 
is not shown), the two-point probability distributions are: 

p(Ai,A2) = ;[L(Ai,A2) + n-iV(Ai,A2)] (A2) 

p(A2,A3) = z[L(A3,A2)+n-iV(A3,A2)] (A3) 

p(Ai,A3) = z{L(Ai,A3) + i(A3,Ai)+n- [iV(Ai,A3) + iV(A3,Ai)]}. (A4) 
Performing the various integrations: 

p(Ai) = l[ f ' L{\i,X2)d\2+n f ' N{\i,\2)dX2} (A5) 

J —oo J — oo 

= ^ [2OA1 exp(^ - ^Aij - exp(^ - ^A? jerfc(-V2Ai)(l - 20Ai) + 3^3^ exp(^ - -^A?^erfc(^ - ^Ai 



L(Ai,A2)dAi+n / iV(Ai, A2)dAi| (A6) 
715 r 15 ,2] 



p(A3) = 'L(A3,A2)dA2+n^ 'iV(A3,A2)dA2} (A7) 

[2OA3 exp(^ - ^A^) + V2^ exp( - ^A^) erfc(^A3)(l - 20A^) - 3^ exp( - -^A^^erfc^^As)] . 



I \ 

_ __/5_ / 9 ,2\ , . / 5 ,2\ . r _/ r^r, , 2 N o _ / 15 ,2\ . r _ /a/S 

127r 

Note the symmetry between p(Ai) and p(A3). In addition, with 5 = Ai + A2 + A3 = ki, one gets 

Pi^' ^3) = ^ [155" - 905A3 + 135A^ - s] ■ exp[ - |(3<5' - WSXs + 15A^)] + ^exp[ - 3(5" - 5<5A3 + yA^)] . (A8) 

By integrating the previous expression over A3, it is direct to verify that p{S) is a zero- mean unit- variance Gaussian distribution. 
In addition: 

n\ 23 , , 3 2 IStt — 27 / . ^\ 

P Ai > = — , Ai = -j=, a^^ = — A9 

25 VlOvr 3U7r 

P(A2>0) = i, (A2) = 0, '^1 = ^ (AlO) 



P(A3>0) = -, (A3) = -^=, < = ^7^ • (All) 

25 VlOvr 307r 

Moreover, the probability distribution of 5 confined in the regions with A3 > is: 
p(A3 > 0) p(A3 > 0) 



75^5^ / 9,2A , 25 / ^^^ ./^<5\1 



Stt V 8 / 4V2^ 

Using the previous formula, it is direct to show that 

(5iA3 > 0) = ^^^(3\/6- 2) ~ 1.6566. (A13) 
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It is also easy to see that the maximum of p(5|A3 > 0) is reached when 5 ~ 1.5: this can be readily achieved by computing 
the derivative of the previous expression and by finding the corresponding zeroes, as done in the main text (see Figure 
Finally, the probability that a given region with density S will have all positive eigenvalues is: 

p{S, \;i>0) 3^/10, / 5,2\ If r^VW5\ ^^VWS\i 
KAa >0\S) = =-^S exp( - -S^) + - [erf (^) + erf (^)J . 



